Accelerating AI inferencing with external KV Cache on Managed Lustre
cloud.google.comยท4h
๐๏ธLLM Infrastructure
Flag this post
Building Up And Sanding Down
endler.devยท20h
๐ชPrompt Engineering
Flag this post
Rearchitecting Vector Search: A Migration from MongoDB Atlas to Qdrant
pub.towardsai.netยท13h
๐ฏQdrant
Flag this post
How We Saved 70% of CPU and 60% of Memory in Refineryโs Go Code, No Rust Required.
๐ฌRust Profiling
Flag this post
Down with template (or not)!
cedardb.comยท20h
๐ฆRust Compiler Internals
Flag this post
Your AI Models Arenโt Slow, but Your Data Pipeline Might Be
thenewstack.ioยท2h
๐Model Serving Economics
Flag this post
How Distributed ACID Transactions Work in TiDB
pingcap.comยท4h
๐๏ธFoundationDB
Flag this post
๐ง ๐ Excited to introduce Supervised Reinforcement Learningโa framework that leverages expert trajectories to teach small LMs how to reason through hard problems ...
threadreaderapp.comยท18h
๐๏ธLLM Infrastructure
Flag this post
Andrew Shindyapin: AIโs Impact on Software Development
skmurphy.comยท17h
โกDeveloper Experience
Flag this post
Show HN: GPU-accelerated sandboxes for running AI coding agents in parallel [video]
๐ฅGPUs
Flag this post
ImapGoose status update: v0.3.2
whynothugo.nlยท6h
๐พPrompt Caching
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comยท11h
๐ก๏ธAI Safety
Flag this post
Intel to Compete with Broadcom and Marvell in the Lucrative ASIC Business
semiwiki.comยท7h
๐ปChips
Flag this post
Loading...Loading more...